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We consider the task of deterministically entangling two remote qubits using joint measurement 
and feedback, but no directly entangling Hamiltonian. In order to formulate the most effective 
experimentally feasible protocol, we introduce the notion of average sense locally optimal (ASLO) 
feedback protocols, which do not require real-time quantum state estimation, a difficult component 
of real-time quantum feedback control. We use this notion of optimality to construct two protocols 
which can deterministically create maximal entanglement: a semiclassical feedback protocol for low 
efficiency measurements and a quantum feedback protocol for high efficiency measurements. The 
latter reduces to direct feedback in the continuous-time limit, whose dynamics can be modeled 
by a Wiseman-Milburn feedback master equation which yields an analytic solution in the limit of 
unit measurement efficiency. Our formalism can smoothly interpolate between continuous-time and 
discrete-time descriptions of feedback dynamics, and we exploit this feature to then derive a superior 
hybrid protocol for arbitrary non-unit measurement efficiency that switches between qnantum and 
semiclassical protocols. Finally, we show using simulations incorporating experimental imperfections 
that deterministic entanglement of remote superconducting qubits may be achieved with current 
technology using the continuous-time feedback protocol alone. 


1. INTRODUCTION 

Engineering of quantum devices requires optimization 
of two essentially contradictory requirements. On one 
hand, quantum properties such as superposition and en¬ 
tanglement upon which these devices rely are fragile, and 
require careful isolation from external degrees of freedom. 
On the other hand, control and measurement of a system 
requires coupling to an external device, which often runs 
contrary to the necessity for decoupling from the environ¬ 
ment. To balance this trade-off, many quantum systems 
relevant for quantum computing and sensing lack readout 
capabilities that are effectively instantaneous and pro¬ 
jective. Instead, measurement occurs over a finite, re¬ 
solvable time scale in such systems. Recent research has 
taken advantage of this by utilizing the fact that continu¬ 
ous weak measurement enables direct observation of the 
continuous time evolution of a quantum system (quan¬ 
tum trajectories) [1-4], and also permits operations on 
the system during the measurement process, including 
feedback and feed-forward control [5-8]. 

Real-time quantum feedback control is expected to be 
broadly applicable to many problems in quantum infor¬ 
mation science. Some quantum information applications 
that have been proposed to date include, rapid purifica¬ 
tion of qubits or qubit registers [9-14], quantum error 
correction [15, 16], transmission of quantum information 
through noisy channels [17], adaptive measurement for 
quantum state discrimination [18-20], and several forms 
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of quantum state preparation and stabilization [21-25]. 

Experimentally, quantum feedback has been mostly 
demonstrated to date with atomic, molecular, or optical 
systems, e.g., [6, 7]. However, the advent of quantum- 
limited microwave amplifiers has recently enabled exper¬ 
imental realization of quantum feedback in superconduct¬ 
ing circuits, where it has been used to stabilize the Rabi 
oscillations of a single qubit [8] and to deterministically 
create entanglement of two qubits within a single cavity 
[26] using a discrete feedback loop. 

Several works have also suggested using quantum feed¬ 
back to enhance generation of remote entanglement [27- 
30]. These papers have considered the case in which 
the controller has access to a joint measurement on a 
pair of qubits that are too far apart to engineer a direct 
or photon-mediated interaction, an important scenario 
for quantum networks or large-scale quantum computers 
[31]. The remote aspect requires that the quantum feed¬ 
back operations be restricted to local unitaries, which 
cannot on their own generate entanglement [32]. Con¬ 
versely, a joint measurement alone cannot deterministi¬ 
cally project a separable system into a fixed entangled 
state, but access to this measurement and local unitary 
feedback can [33]. 

In this work, we build upon this literature in our goal 
to achieve optimal protocols for remote entanglement, 
focusing in particular on joint measurements that have 
been implemented in superconducting qubits [4] . We mo¬ 
tivate our approach by an examination of optimality for 
the state update in a single, discrete time step and de¬ 
fine an average sense locally optimal (ASLO) strategy 
that makes the state update over this discrete time step 
using the most recent measurement outcome and knowl¬ 
edge of the average state. We show that this discrete time 
step protocol reduces to a direct feedback protocol in the 
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continuous-time limit, whose dynamics can be modeled 
by a Wiseman-Milburn feedback master equation with 
an analytic solution that yields a simple, closed-form ex¬ 
pression for the locally optimal quantum feedback in this 
limit. This analytic solution shows that for unit measure¬ 
ment efficiency, the fidelity with respect to the target en¬ 
tangled state asymptotes exponentially to 1. The discrete 
time step optimality study also motivates a semiclassical 
protocol that is more effective for low measurement ef¬ 
ficiencies, provided that one takes longer time steps be¬ 
tween applications of feedback. This feedback strategy is 
capable of fully entangling the qubits even for non-unit 
efficiency, but creates entanglement more slowly than the 
continuous case. Both semiclassical and quantum proto¬ 
cols asymptote to unit entanglement fidelity under unit 
measurement efficiency. In the case of non-ideal measure¬ 
ments, the preferred form of the optimal feedback proto¬ 
col is found to depend on the measurement efficiency ry, 
with the semiclassical/quantum protocol being preferred 
for low/high rj values. We then show that for arbitrary 
non-unit efficiency, continuously tuning the measurement 
time leads to a protocol that can surpass this ASLO strat¬ 
egy and that switches from using the quantum protocol at 
short times to the semiclassical protocol at larger times. 

The remainder of the paper is organized as follows. In 
section 2 , we describe the measurement that conditions 
our feedback using both stochastic master equation and 
POVM descriptions. Section 3 derives the optimal feed¬ 
back unitary for a single discrete time step as a function 
of the prior state of the qubits. We first consider the 
case of inefficient measurement in section 4, where we 
neglect coherence terms of the density matrix that are 
dephased by the measurement to obtain a semiclassical 
strategy. In section 5, we use the full density matrix for 
the system to derive a quantum strategy that is valid in 
the continuous measurement limit. In section 6 we com¬ 
bine the semiclassical and quantum protocols to develop 
a superior hybrid protocol that allows unit fidelity to be 
reached for arbitrary non-unit measurement efficiencies. 
Section 7 demonstrates the experimental feasibility of the 
continuous-time ASLO protocol with numerical simula¬ 
tions of remote entanglement generation in a realizable 
system of transmon qubits in spatially separated cavities. 
Section 8 summarizes and provides an outlook for future 
work. 


2. ENTANGLEMENT VIA MEASUREMENT 

To study feedback in the context of remote entangle¬ 
ment generation, we must first describe the measurement 
on which the feedback will rely. As analyzed in [34] , it is 
possible using a dispersive homodyne readout to imple¬ 
ment a half-parity measurement in the superconducting 
circuit architecture. In reference [4], authors succeeded 
in applying this measurement to two superconducting 
transmon qubits separated by over a meter. This mea¬ 
surement is characterized by the operator 


— 2 + ^22), (1) 

where is the Pauli operator acting on the ith qubit. 
This can be used to probabilistically generate entangle¬ 
ment by first preparing the separable uniform superpo¬ 
sition state IV^o) = 5 ( 100 ) -I- jOl) -I- jlO) -I- jll)) (e.g., by 
making two local Gy rotations on the ground state | 11 )) 
and then measuring X. Since this observable cannot dis¬ 
tinguish the states jOl) and jlO), such a measurement 
(if perfect) will with 50 percent probability project the 
initially unentangled \ipo) into the entangled triplet state 
|t0) = ^(|01) + |10)). 



Integrated Homodyne Voltage 


FIG. 1: Histograms of the integrated measurement signal 
for the three eigenstates pi of the measurement, given by 
Tr[flAv,i 7 pi flAVv?]- Plots are for for p = 1 (thin lines) and 
p = 0.2 (thick lines) and use the parameters At = 3/rs and 
k = 1 27rMHz. 


In the superconducting qubit architecture, the imple¬ 
mentation of the measurement is such that projection 
onto one of the eigenstates of X can be said to occur 
only after a resolvable time period. During this time in¬ 
terval one can then perform feedback on the system based 
on information already gained. The measured value of 
the observable X is obtained from a homodyne measure¬ 
ment of the voltage, V [4]. During the measurement, the 
time-dependent homodyne voltage that constitutes the 
measurement signal is specified by [35] 

dV, = {X){t)dt+^^, ( 2 ) 

where (•) denotes an expectation value under p(t), the 
state of the two qubit system, k is the strength of 
the measurement, 0 < 77 < 1 is the measurement ef¬ 
ficiency and dW(t) is a Wiener increment satisfying 
dW{t)dW{t') = 8{t — t')dt that represents the quantum 
noise in the homodyne detection [5]. Note that we have 
chosen dV to have units of time, so that the average signal 
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Ay = ^ / dV is unitless. Experimentally, the appro¬ 
priate scale factor can be extracted simply by preparing 
states with {X) = {1,0,—!} and measuring (AE) for any 
pair of these states. 

The state of the system conditioned on this continuous 
measurement record is given by the following stochastic 
master equation [5]: 

Pdv(d + dt) = p{t) + 'D[M]p{t)dt + 'H[M]p{t)y/rjdW{t); 

M = V^X, (3) 


measurement signal. The measurement operator corre¬ 
sponding to the observed fraction r] of the signal is given 
by 


n 


AV,J) — 




1/4 

exp 


-2rikXt{XV - X f 


( 5 ) 


After evolution by Eqn. (3) for a discrete time At, the 
two-qubit state conditioned on the finite voltage incre¬ 
ment Ay is then given by 


Here V[M] = MpM'f - l/2(Ml'Mp -h pM^M) and 
H[M]p = Mp + pM"^ — {M + M^)p (throughout this 
paper we use units such that h = 1). The second term in 
Eq. (3) represents the dephasing due to the measurement 
and the third term represents the information update of 
the system state derived from the measurement outcome. 
Note that equation 3 is expressed in the interaction pic¬ 
ture with respect to the free Hamiltonian for the two 
qubit system, Hq = - 

Eq. (3) generates a set of stochastic quantum trajectory 
equations that provides a description of the measurement 
conditioned dynamics in continuous time. In the follow¬ 
ing we will also be interested in the conditioned dynam¬ 
ics at discrete time intervals. In order to obtain such a 
discretized description of the system, we must compute 
the finite-time generalized measurement, or positive op¬ 
erator valued measure (POVM) [-5, 36] that is associated 
with the above continuous time weak measurement. Such 
finite-time POVM descriptions are generally difficult to 
compute for weak measurements. However in the case 
of quantum non-demolition (QND) measurements, for 
which [Hq,X] = 0, this can be readily derived. As shown 
in Appendix A, the finite time POVM for such weak QND 
measurements is composed of a set of effects E{AV) = 
[36], with Hav an operator corresponding to 
the measurement of voltage increment Ay during a fi¬ 
nite time interval At. These effects satisfy the comple¬ 
tion property / E{AV)d{AV) = J il\yV,/^Yd{AV) = 1 , 
where the integral is performed over the domain of the 
voltage increment. As shown in Appendix A, in the case 
of perfect measurement efficiency, ly = 1, the operator 
corresponding to the above weak measurement is given 
by 


Hav 


(4^) 


1/4 

exp 


-2kAt{AV - Xf 


( 4 ) 


The state update corresponding to a discrete time mea¬ 
surement outcome Ay is then given by the familiar re¬ 
lation p —)■ HAvpH^y/Tr[HAypH^y]. In order to model 
inefficient measurement, we note that a weak QND mea¬ 
surement with strength k and efficiency rj 1 can be 
viewed as a sequence of two weak measurements of the 
same observable, the first with strength kp and the sec¬ 
ond with strength fc(l — ry), where we integrate out the 
latter in order to model the loss of this portion of the 


PAv{t + At) = 

poo 

/ ^AV',l-r 


^AV.rj p(^) ^AV,r] 

Tr[HAv,,, pit) 


,1-rjdAV )■ 


( 6 ) 


Equations 3 and 6 define the time evolution of p under 
generalized measurement of the observable X for contin¬ 
uous and discrete time increments, respectively. In figure 
I, we plot histograms of the measurement outcomes for 
ry = 1 and ry = 0.2, at a fixed measurement time At. 
For an inefficient measurement, the variance of the out¬ 
comes is larger, which represents uncertainty due to loss 
or noise. For unit efficiency, there is a residual variance 
due to vacuum fluctuations of the readout signal, and 
overlap between the Gaussians due to this effect leads 
to the measurement being non-projective, or weak. The 
limit of projective measurement can be recovered by ei¬ 
ther letting the fc —> oo, i.e., making the measurement 
infinitely strong, or At —> oo, corresponding to an in¬ 
finitely long measurement time. As noted earlier, putting 
the system initially into the equal superposition state ipo 
and then measuring X can project the system into a max¬ 
imally entangled state, but the probability for achieving 
this by measurement alone cannot exceed 50%. In the 
following sections, we will see that feedback can increase 
this probability to 1. 


3. OPTIMAL ROTATION IN A SINGLE TIME 
STEP 

The only nonlocal resource in our deterministic entan¬ 
glement scheme is the half-parity measurement X that 
was introduced in the previous section. Due to the 
lack of direct qubit-qubit coupling, the remaining con¬ 
trol resource, the quantum feedback operations, will be 
restricted to local rotations that act on each qubit in¬ 
dividually. Specifically, we define the feedback unitary 
as 


Hf[0i,02,<(>1,<^2] =G(0i,</)i)GH(02,<A2) (7) 

where 


{7(6*, (p) = J 2 cos 0/2 — in{<p) • o'sin 0/2 

is a general single qubit unitary rotation (a is a 3-vector 
of the Pauli matrices, n{4>) is a real 3-vector of unit norm 
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and I 2 is the identity matrix). Given the starting point 
of the equal superposition state |V’o)i any target state 
lying within the triplet manifold may be obtained by ro¬ 
tations within a fixed 4> plane of the Bloch sphere of each 
qubit. Hence we set fi(0) ■ a = ctx cos (f) + (Jy sin^. We 
note that in the presence of dephasing one might wish 
to remove this restriction and additionally allow rota¬ 
tions, in order to introduce rotations between the target 
state \t0) = ^(|01) +110)) and the corresponding singlet 

|s) = ;^(|01) — 110)). However, our measurement oper¬ 
ator X is not capable of distinguishing these two states, 
and so does not yield any direct information determining 
when to apply this operation. Hence the present con¬ 
struction of optimal strategies will assume negligible de¬ 
coherence, although we shall retain the density matrix 
element pss throughout the analysis in order to study 
the impact of dephasing (see section 7). 

Our goal is to find the optimal values of 9i and (j)i 
as a function of time and measurement outcomes XV 
that will maximize entanglement. We choose the fidelity 
[37] of p with respect to the pure state pto = jtO) (tOj, 
i.e., T{a,pto) = Tr[^y^py^]2 = ptoto, as a proxy 
for entanglement rather than the concurrence [38] with 
this state, since the former yields a hgure of merit that 
is linear in the state p. We further simplify the setup 
by enforcing identical local feedback unitaries satisfying 
the properties 9i = 02 and (j)i = (j )2 = 7r/2. The triplet 
subspace is closed under local unitary rotations satisfying 
these constraints. We assume that the initial state is 
in the triplet subspace, so that we do not require more 
the general rotations that connect the singlet and triplet 
subspaces. This is consistent with initializing in the equal 
superposition state {ipo). 

In order to find the optimal feedback rotation after a 
measurement, regardless of whether this is in a discrete or 
infinitesimal time increment, we parameterize the density 
matrix as follows: 


Pt-t- 

Pt-to 

Pt-t+ 

Pt — s 

Pt-to 

Ptoto 

Ptot+ 

PtOs 

Pt-t+ 

Ptot+ 

Pt+t+ 

Pt + s 

-P*t-s 

PtOs 

Pt+s 

Pss 


where \t-) = | 00 ), \t+} = [ 11 ) and Js) = :^(| 01 ) - jlO)). 
Since we are restricting the feedback operations to ay 
rotations {(p = 7r/2) and our measurement is represented 
by a real matrix, we may further restrict p to be a real 
matrix. The fidelity with respect to jtO) after applying 
identical ay rotations on both qubits is given by 

F = {t0\pi\t0) = Ptoto + ^{V8sm29{pt-w - Ptot+) + 

(1 - cos20)(l - Sptoto - ‘2pt-t+ - Pss)), (8) 

where pi = Uf[9,9, f, f] p Up[9,9, f, f ]■ The optimal 
rotation angle 9 is then found by maximizing (tOjpijfO) 


over 9 and is given by, 


— 2 3ptoto+Pss + 2pt-t-i- —1]) 

( 9 ) 

where arctan[p, x] behaves as arctan[p/a;], but with 9 cho¬ 
sen in the correct quadrant, i.e., 


arctan[?/, x] 


arctan[p/a;] 
arctan[p/a;] -|- tt 
arctan[p/a;] — tt 
(p/|y|)7r/2 


a: > 0 
y > 0, a: < 0 
y < 0, a: < 0 
a: = 0 


( 10 ) 


3.1. Average-sense local optimality 


Eqn. (9) defines the optimal feedback as a function of 
the density matrix in a single time step. This relation 
will define a locally optimal protocol (sometimes referred 
to as a greedy strategy [39]), meaning that the controller 
maximizes the figure of merit at each time step How¬ 
ever this may not be viable for practical applications. 
Although the controller does in principle have access to 
the density matrix at every time step, actually calcu¬ 
lating p(t), e.g., using Eqn. (3) or Eqn. (6) amounts to 
dynamical state estimation, which may be too computa¬ 
tionally intensive to implement ‘on-the-fly’, i.e., in real 
time, and hence experimentally infeasible. Ideally, one 
would prefer to provide a protocol that does not require 
storing and manipulating the entire measurement record. 

In order to obtain an experimentally feasible protocol, 
we drop the requirement that a protocol be locally opti¬ 
mal, and instead search for one that is locally optimal in 
an average sense. Given a Markovian feedback protocol 
Up, we define the average evolution over a time interval 
containing measurement and feedback by 


p{t + Xt)= (11) 

J UplpAvit + At)]pAv{t + Xt)Up[pAvit + At)]d{AV) 


for discrete measurement, and by 


p{t + dt) = 


( 12 ) 


J UpiPdvit + dt)]pdv{t + dt)Ul[pdv{t + dt)]d{dV) 


for continuous measurement. For an ASLO protocol, 
Uf[p] = t7F[0opt[p],0opt[/o],7r/2,7r/2] and PAV(dV)it + 
At(dt)) is the average state after a measurement initi¬ 
ated at time t as generated by Eqn. (6) (Eqn. (3)) acting 
on p{t). For simplicity of notation, we shall refer to the 
measurement outcome as V, regardless of whether the 


^ The term “local” refers to time-local here, and not spatially local 
(as in local unitary rotation). 
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time interval is discrete or infinitesimal. Note that we 
are neglecting the time it takes to apply the unitary op¬ 
erations, so that only the measurement contributes to the 
time duration of the feedback process. 


Because of the integration over measurement out¬ 
comes, p{t) is not a stochastic quantity, and thus it may 
be explicitly computed as a function of time for a given 
feedback protocol Uf[pv] and initial state p{t — 0 ). (A 
controller would only have access to p if it applied feed¬ 
back but did not save the measurement record.) Because 
we have set Up io he a function of pv, we have implicitly 
restricted the feedback operation to depend only on the 
most recent measurement outcome V (t) and the current 
time t, given the known time evolution of p(t). Con¬ 
sequently the feedback is Markovian. The ASLO feed¬ 
back operation 0 opt is defined by replacing p with pv in 
Eqn. (9), and can be specified in a lookup table which can 
be computed beforehand for a given initial state. This is 
a substantial reduction in computational overhead, and 
as we will see in later sections, this lookup table can in 
some instances be implemented by a passive device such 
as a mixer (multiplier), essentially making the feedback 
autonomous. 


It is useful to analyze the symmetries of the frame¬ 
work that we have outlined above. Suppose we start in 
an initial state that is symmetric under the transforma¬ 
tion | 0 ) O | 1 ) on both qubits (henceforth referred to 
simply as a “01-symmetric” state). This symmetry im¬ 
plies that pt+t+ = Pt-t-, Pt-to = Ptot+ and (A) = 0. 
Measurement can stochastically break this symmetry by 
pushing the state toward |00) or |11). However, because 
the target state |< 0 ) also respects this constraint, feed¬ 
back will act equally and oppositely for these two cases 
(z.e., dopt(E) = —^opt(—E)), and thus this symmetry 
will be restored after integrating over V as in Eqn. (11) 
or Eqn. (12). Thus we may assume throughout that p 
retains this symmetry so long as the initial state does. 
Note that unlike both the target and equal superposition 
state which are both 01 -symmetric, the ground state | 11 ) 
is an example of a state that is not 01 -symmetric, and 
thus we have excluded a natural initial state for the prob¬ 
lem. However in this case it is locally optimal to apply 
a 7r/2 rotation on both qubits, after which we will be in 
the equal superposition state |' 0 o)- 


4. DISCRETE FEEDBACK AND THE 
SEMICLASSICAL PROTOCOL 


We now consider specific feedback protocols based on 
ASLO introduced in the previous section. In this section 
we consider the situation where the quantum efficiency of 
the measurement is small, i.e., ry <C 1 , which is a highly 
relevant scenario in many experimental settings. In this 
regime, measurement induced dephasing quickly reduces 
the off-diagonal elements of the density matrix to zero 
(see Eqn. (3)), so that the controller only has access to 
the classical probabilities associated with the three mea¬ 
surement eigenstates. Without knowledge of the coher¬ 
ences, we shall arrive at a semiclassical protocol. This 
will not only provide a useful comparison to the more 
general quantum protocol derived in the following sec¬ 
tion, but will also be important for developing a hybrid 
protocol for arbitrary, but not necessary small efficiency 
r] < 1 (section 6 ). 

In the limit of very small 77 , off-diagonal elements of 
p{t) will be approximately 0. To study feedback in this 
case, we explicitly set the off-diagonal elements to be 
some small quantities pt-to = Ptot+ = e and Pt-t+ = f' 
respectively, so that the first argument (y) of the arctan 
function in Eqn. (9), and thus arctan[y/a;] are approx¬ 
imately zero. If the second argument (cc) is positive 
ie., 3ptoiO,Ay-l-2e' « 3ptoto,Ay-l > 0 , then dopt = 0 
(assuming for simplicitly of the resulting equations that 
the singlet subspace is unpopulated). Using equation 
6 , we see that for an initially 01 symmetric state with 
fidelity ptoto; the hdelity ptoto.AV after measurement is 
given by 

Pmo,AV = ( 13 ) 

_ Ptoto _ 

+ lz»(g-4r,fcAt(l-H2Ay))(l + gl6r,fcAtAy) 

Because Eqn. (13) decreases monotonically away from 
AU = 0, the above inequality yields a threshold behavior 
for the feedback strategy, in which the preferred opera¬ 
tion is to do nothing unless |AU| exceeds some critical 
value Vr.opt- In this case {x < 0), then dopt = ±7r/2 
where the sign is chosen to match the sign of y. Using 
Eqn. (4) to calculate y, it is not difficult to show that 
the sign of y is the same as that of eAV^. The optimal 
threshold voltage is given as 


Ur,opt 


„ . arccosh -—^^^^^^expUnkAt) , 

SrjkAt 1 - Ptoto 


(14) 


While feedback strategies designed in this manner may 
not perform as well as those in which full dynamical state 
estimation is used at each time step, the added simplic¬ 
ity of average sense local optimality makes the resulting 
protocols substantially simpler to implement experimen¬ 
tally. Note that until Section 6 , none of the strategies 
we formulate attempt to surpass the performance of a 
locally optimal protocol. 


which defines the semiclassical protocol. Note that 
Ur.opt = 1/2 in the projective measurement limit, kAt ^ 
1. Eqn. (14) has a simple interpretation. If the state is 


^ If e is identically zero, then one may take it to be an infinitesimal 
positive or negative value when calculating Bopt- 
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already entangled with high probability, the controller 
does nothing. If the probability of being in either \t—) 
or |t+) is above a certain threshold, one applies a ±7r/2 
pulse to both qubits, which essentially resets the state to 
the product state jdO) + |1)) 0 ((|0) + |1)) = |'0o) and 
gives the joint measurement another chance to collapse 
to the entangled \t0) state. 

This strategy is classical in the sense that the opti¬ 
mal feedback could just as easily be calculated by using 
the classical Bayes rule to combine prior state knowledge 
with the information gathered from the previous mea¬ 
surement to determine whether it is beneficial to apply 
feedback or do nothing [40]. Note that the larger ptoto is, 
the larger Vr.opt and hence the wider the range of volt¬ 
age in which no operation is perfomed, [—Vr.opt; br.optj- 
Qualitatively, this is because as the fidelity with jtO) in¬ 
creases, it becomes more likely that the qubits are in the 
entangled state, and so new information needs to suggest 
with higher probability that the system is in \t—) or |t-|-) 
before applying a rotation is beneficial, consistent with 
the Bayesian interpretation. 

If one instead uses a fixed threshold value for feedback, 
one can analytically solve for the steady state fidelity 
under this protocol: 

J^ss = (t0|p(oo)|t0) = (15) 

^ 4erfc[^] 

erfcjilTidl] _|_ _|_ gj-fgjibzplij 

where a = l/^/AkrjAi is the standard deviation of the 
measurement operator, and erfc is the complement error 
function, 1 — erf(a;). We can see by taking limits of this 
expression that one can only reach unit steady-state fi¬ 
delity in the limit that Vr ^ cr, or in other words, the 
threshold voltage is much greater than the variance of 
the Gaussian weak measurement effect, Eqn. (5). 

The discussion above defines a viable locally optimal 
discrete feedback protocol using a half-parity measure¬ 
ment. A similar procedure using a full parity measure¬ 
ment has been used to deterministically entangle qubits 
located in the same cavity [26] , without analysis or proof 
of any optimality properties. 

5. THE CONTINUOUS-TIME CASE AND THE 
QUANTUM PROTOCOL 

If one attempts to derive a continuous-time protocol 
using the above result, the increase in fidelity becomes ar¬ 
bitrarily slow in the small At limit. This is problematic 
for implementations. The underlying issue is apparent 
from examination of the At ^ 0 limit of the threshold 
voltage in Eqn. (14) is taken. Vr.opt diverges as 1/At, but 
the standard deviation of the measurement outcomes di¬ 
verges more slowly, as l/\/Ai. (We note that the fact 
that these quantities diverge is an artifact of our nor¬ 
malization convention for AV. No physical observable 



FIG. 2: Discrete-time feedback simulations, showing fidelity 
with the I to) state as a function of time, starting from the 
maximally mixed state in the triplet subspace. For these sim¬ 
ulations, 77 = 0.1, k = 1 27rMHz and At = 20ps. The inset 
shows the optimal threshold voltage as a function of time. 
Note that for a smaller threshold, fidelity increases quickly at 
first but then saturates to a value significantly less than one, 
while for a larger constant threshold, fidelity increases slowly 
at first but then surpasses the former and approaches unity, 
though it does not asymptote to 1 (see Eqn. (15)). The locally 
optimal strategy, which increases the threshold as a function 
of time matches or surpasses both fixed-threshold strategies 
at all times. 


diverges in this limit.) Thus the probability that this 
feedback strategy will result in performing any opera¬ 
tion on the state vanishes. In order to derive a viable 
continuous-time protocol, we must therefore include co¬ 
herences and take the full density matrix into account 
instead of just the diagonal elements. 

We approach this by recognizing that Eqn. (9) itself 
defines an ASLO protocol and can therefore be used to 
derive both discrete and continuous-time feedback strate¬ 
gies directly, without setting the off-diagonal terms of 
p to zero. Fig. 3 shows the performance of this dis¬ 
crete time strategy in the case of perfect efficiency 77 = 1 
for various choices of At, and compares with the semi- 
classical protocol derived in the last section. Not sur¬ 
prisingly, Fig. 3 shows that the quantum protocol has 
strictly better performance over the semiclassical pro¬ 
tocol. However, the performance gap between the two 
protocols closes as At increases. The reason for this is 
that both the measurement-induced dephasing and the 
time interval between feedback operations increases as 
At increases, so that the density matrix for which the 
feedback is calculated becomes closer and closer to a di¬ 
agonal form. One can see this behavior explicitly in Fig. 
3 (b), in which we plot the applied feedback as a function 
of the measurement outcome. At late times, it resembles 
the semiclassical protocol. We will revisit this point in 
section 6 . 

The above example still constitutes discrete feedback. 
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FIG. 3: (a) Discrete feedback simulations, showing fidelity 

with the I to) state as a function time under the semiclassi- 
cal and quantum feedback (Eqn. (9)) strategies, starting in 
the equal superposition state \iPq) and assuming a unit effi¬ 
ciency measurement and fe = 1 27rMHz. The performance 
also changes according to the discrete time step. At: we show 
two representative cases here, (b) The feedback 9opt{V) ap¬ 
plied by the quantum protocol shown in panel (a) for the 
case At = Q.2/k at four distinct time steps. At the latest 
time step, 0opt resembles the semiclassical protocol because 
the off-diagonal elements of the density matrix have decayed 
to zero. 


in which the measurement and feedback unitary rotations 
act sequentially. To derive a continuous protocol, we take 
the measurement strength to be small and assume that 
infinite strength rotations are available. We show below 
that this unphysical assumption will be unnecessary, but 
it is convenient for the initial derivation. 

Given that p is 01-symmetric for any evolution starting 
from an initially 01-symmetric state, dV oc dW because 
{X) = 0. Furthermore, inspection of Eqn. (3) shows that 
the quantity pt-to,dv - Pm+,dv appearing in the first 
argument of the arctan function in Eqn. (9) is infinites¬ 
imal and proportional to dW. We can then substitute 
the continuous-time measurement update in Eqn. (3) into 


Eqn. (9), and expand in a Taylor series with respect to 
Pt-to,dv — Ptot+,dv oc dW. Assuming that the second ar¬ 
gument in the arctan function is greater than zero^, this 
yields a proportional feedback strategy in which the feed¬ 
back rotation is proportional to the measurement out¬ 
come dW via 


^opt - 


_ Ay/kqpt-to _ 

3ptOtO + Pss + ‘2pt-t+ — 1 


dW, 


or equivalently, 


Popt{t) = 


dopt 

~dV 


_ S^/2kr]pt-to _ 

3ptoto + Pss + 2pt -t+ — 1 


(16) 


(17) 


This shows that the ASLO feedback is in fact direct 
feedback in the continuous-time limit, where the feed¬ 
back rotation angle is directly proportional to the mea¬ 
surement value. This is the type of feedback is modeled 
by a Wiseman-Milburn feedback master equation [5, 41] 
given in this instance by 

dp = 'D[M]pdt + 'H[M]p^dW - i[Hp, p] 

(18) 

with 

Hp = —-I- ay2)- (19) 


dp may be calculated as before using the averaged ver¬ 
sion of Eqn. (18) i.e., integrating out dW. Substituting 
Eqn. (17) into Eqn. (18) then yields the equations of mo¬ 
tion for the state under the ASLO quantum, continuous¬ 
time protocol. While this equation is difficult to solve in 
the general case, for 77 = 1 it admits an analytic solution 
when the initial state is pure and satishes the usual sym¬ 
metry property. To find this solution, we take the trial 
solution to be pure and 01 -symmetric, so that p = \il)) (i/)] 
with 


\m) = ^ptotoW, 


l—ptOto{t) 

2 • 



Substituting this form into Eqn. (18) yields the following 
differential equation for ptoto 

Ptoto) Ptoto{t) = 1—(1—Ptoto(0))e 

( 21 ) 


^ If we start in the separable state (|0) + |1})(S>(|0) + |1)) as is most 
practical, this constraint will hold true for all later times. If we 
start in a state that does not satisfy this property, it is locally 
optimal to first apply a n/2 rotation on both qubits, after which 
the second argument will be greater than zero. 





































Crucially, the terms in Eqn. (18) that depend on dW 
cancel upon substitution. This cancellation ensures that 
the actual state equals the average state at all times 
*.e., p{t) = p{t), so that the average state evolution is 
pure under this feedback protocol. This implies the non- 
averaged evolution is also deterministic, and thus dynam¬ 
ical state estimation is not necessary to implement the 
locally optimal strategy in this special case. 

Having a solution for the evolution of the full density 
matrix in the case rj = I also yields an analytic solu¬ 
tion for the optimal proportionality coefficient between 
feedback and measurement, Hopt, as a function of time: 


p p-. ^ 4fc(l-ptoto(0)) 

\/(l ~ Ptoto( 0 ))(Ptoto( 0 ) — 1 + 


( 22 ) 



This equation displays a marked similarity with the 
optimal feedback for single qubit purification [9], with 
the functional form differing only by two minus signs in 
distinct locations. Like this quantum protocol for pure 
states, the single qubit state evolution under optimal 
feedback is also deterministic, and has been shown to 
be globally optimal for r; = 1 [9, 12, 14]. These parallels 
lead us to speculate that the protocol given by Eqn. (22) 
may be globally optimal as well. In the case of non-unit 
measurement efficiency, ptoto appears to exactly follow 
an exponential which asymptotes to a value less than 1 . 
However, we have not been able to find an analytic solu¬ 
tion. 

Fig. 4 shows the fidelity as a function of time using the 
quantum protocol for p = 1, comparing the performance 
of the ASLO protocol with time dependent Popt(t) (blue 
line) to that obtained by using a constant multiple of 
the zero time value Popt ( 0 ) (red and yellow-dotted lines). 
The superiority of the ASLO protocol is evident. 

It is useful to study the asymptotic behavior of these 
protocols. The numerical simulations show that with the 
ASLO strategy, the fidelity asymptotes quickly to one, 
while using a constant multiple of Popt ( 0 ) does not reach 
unit fidelity. This raises the question of whether a spe¬ 
cific fixed value of P could also yield unit fidelity. By 
fixing dp = 0 in Eqn. (18), we obtain a system of linear 
equations which can be solved to yield the steady state 
fidelity for fixed P: 

_ + Wk^yjl + 8p) 

ss./ 3p2 _|_ i6fc2p(3 -I- 8p)' 

This result is valid for P 7 ^ 0 (when P = 0, any state 
that commutes with the measurement is a steady state). 
For large P, the steady state fidelity tends to 1/3, while 
in the limit P —)■ 0, the steady state fidelity is given by 
(1 -I- 8p)/(3 -I- 877 ). Moreover, since the denominator in 
Eqn. (23) is greater than the numerator for all values of 
parameters, Pssj < 1- This analysis proves that it is not 
possible to obtain unit fidelity with a constant value of 
P, and therefore a time-varying direct feedback protocol 
is necessary in order to reach unit fidelity at long times. 


FIG. 4: Continuous feedback simulations, showing fidelity 
with the I to) state, P, as a function of time under the 
continuous-time ASLO protocol given in Eqn. (17). The ini¬ 
tial state is the separable state I'i/'o)- The simulation param¬ 
eters are k = 1 27rMHz, rj = 1, with time step dt 1/k. 
Fidelity as a function of time for several values of constant 
proportionality coefficient P are also shown for comparison. 
Inset shows the steady state fidelity Pssj achieved for con¬ 
stant P, according to Eqn. (23). Since we have taken the 
small dt limit, the threshold strategy would not change from 
its fidelity at t = 0. Note that if we were to plot the locally op¬ 
timal strategy in this continuous time situation, which would 
nomically require dynamical state estimation, it would coin¬ 
cide exactly with the curve for Popt (t) • 


This quantum protocol has several appealing features 
for experimental realization. Firstly, proportional feed¬ 
back is realizable simply by using a mixer or analog mul¬ 
tiplier [ 8 ] , both of which have very low latency. Further¬ 
more, unlike the semiclassical protocol, only infinitesimal 
rotations are called for in any given time step, which re¬ 
duces the resources necessary for implementation. 


6. INEFFICIENT MEASUREMENT AND 
HYBRID PROTOCOLS 

For unit efficiency, the fidelity quickly reaches 1 when 
continuous-time feedback is used. But for 77 < 1, the 
fidelity asymptotes to a value less than 1 (see Fig. 5). 
Qualitatively, this happens because the off-diagonal el¬ 
ements of p that drive feedback in the continuous-time 
case (see Eqn. (17)) decay faster relative to the feedback 
terms which increase the fidelity. In contrast, if we imple¬ 
ment discrete-time feedback for the same value of 77 < 1 , 
the fidelity increases more slowly at first, but eventu¬ 
ally surpasses the asymptotic fidelity of the continuous¬ 
time strategy. Moreover, we know from Sec. 4 that the 
semiclassical strategy is unaffected by decay of the off- 
diagonal terms, and this strategy does reach a fidelity of 
1. However this strategy is only viable when the measure¬ 
ment time is finite. These facts suggest that we consider a 
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hybrid protocol that transitions between continuous-time 
and discrete-time feedback; i.e., has a variable measure¬ 
ment duration. 

To determine how to make this transition, we perform 
a numerical optimization of the measurement durations 
as follows. We divide the system evolution into 200 dis¬ 
crete steps each of duration Ati subject to the constraint 
Ati = Tfinai- At each time step, discrete-time mea¬ 
surement, according to Eqn. (5), followed by feedback 
according to the general optimal function Eqn. (9) is ap¬ 
plied. Then an optimization over all time intervals Ati 
is performed (using the gradient descent algorithm) to 
minimize the cost function 1 — (t0|p(rf}nai)|i0). 

The results of this optimization are shown in Fig. 5, 
where we plot the resulting fidelity as a function of time, 
and compare to evolutions in which all Ati are fixed to 
a constant finite or an infinitesimal value. The opti¬ 
mization consistently finds a minimum in which the ma¬ 
jority of the Ati intervals are small and approximately 
equal, while a few at the end are large. In other words, 
the optimal solution found by gradient descent shows 
a sharp transition between continuous-time feedback at 
short times to discrete-time feedback at long times. Af¬ 
ter the switching time, the off-diagonal elements are ob¬ 
served to be small, and thus the applied feedback closely 
resembles that of the semiclassical protocol. 



FIG. 5: Fidelity IF starting from the state as a function 
of time under the optimal variable time step protocol (blue 
circles), compared to fidelity obtained with constant time step 
protocols (discrete time semiclassical protocol (red crosses), 
continuous time quantum protocol (yellow dotted line)). The 
measurement is on continuously at a constant rate and feed¬ 
back is applied instantaneously at each point in the graph. 
Parameters fc = 1 27rMHz and rj = 0.4 are chosen to illus¬ 
trate the benefit of this optimized strategy for non-unit but 
not too small efficiency. The dense cluster of points before 
t = 2pLS shows that the gradient search resulted in almost 
all of the times for application of feedback to be located in 
the early stages of the evolution. Also shown is the fidelity 
obtained from a full non-Markovian strategy using dynamical 
state estimation and locally optimal feedback (see text). 


Since the measurement rate k is held constant in the 
above optimizations, an alternative way to view the 
change in optimal measurement time step is to view the 
finite duration measurements as a series of infinitesimal 
measurements. This perspective lends itself to the in¬ 
terpretation that when the hybrid protocol uses discrete 
feedback, it abstains from applying feedback at one time, 
only to apply a stronger feedback at a later time, and 
thus is not locally optimal. While we cannot prove any 
optimality properties of this hybrid protocol, its superior 
numerical performance indicates that it is significantly 
closer to global optimality than both of the fixed At pro¬ 
tocols derived in previous sections. 

Fig. 5 also shows that there exists a non-Markovian 
strategy which outperforms the hybrid protocol described 
above. Simulating continuous feedback and averaging 
over 300 trajectories, we plot the fidelity as a function 
of time assuming that the controller performs dynami¬ 
cal state estimation at each time step and then applies 
the optimal feedback unitary, Eqn. (9). Because simula¬ 
tion of trajectories tracks the actual state as opposed to 
the average state, this protocol is locally optimal, as op¬ 
posed to ASLO. This strategy is found to surpass the hy¬ 
brid protocol at both early and late times, showing that 
for ?7 < 1, dynamical state estimation can yield a bet¬ 
ter protocol than one based only on the average state. 
However, as discussed earlier (see section 3.1), such a 
non-Markovian protocol is generally more difficult to im¬ 
plement in practice. 


7. EXPERIMENTAL REALIZATION 

In any experimental realization of the proposed feed¬ 
back schemes to achieve entanglement between remote 
qubits, numerous imperfections will complicate the dy¬ 
namics studied above. Firstly, finite coherence of the 
qubits will limit the fidelity. Second, the above results ap¬ 
ply only to truly Markovian dynamics, in which the con¬ 
troller acts instantaneously with an action that is based 
solely on the most recent measurement outcome. In prac¬ 
tice, the homodyne measurement will have finite band¬ 
width and the feedback will necessarily act with some 
finite delay. In this section we present simulations of the 
continuous-time quantum feedback protocol specified in 
Eqn. (17), in which these imperfections are now incorpo¬ 
rated. We use the full stochastic master equation includ¬ 
ing finite detector bandwidth that is derived in [42], 

dp = 'D[M]pdt + 'H[M]p^ridW + Cqdt 


nOO 


/ 

/to 

JUjuj \ _ 

Vv 


( 24 ) 
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where >Cq models relaxation Ti and dephasing on both 
qubits 





25[(T2,i]p + 



V[ai]p 


(25) 


Notice that the term in square brackets in Eqn. (24) is 
simply dV. Simulations are for experimental parameters 
77 = 0.4, dephasing times = 6.9pls/2tt and = 
30 ^ 3 / 2 ^ for the first and second qubits, respectively, re¬ 
laxation time Ti = 20p,s/27r [4], feedback delay to = 
100ns, feedback loop bandwidth l/r = 1.6 27rMHz [ 8 ] 
and measurement rate k = 1.3 27rMHz. The effective de¬ 
phasing times are obtained from combining an intrinsic 
qubit dephasing time of 3077 s/ 27 r with a loss of 0.04 in 
amplitude units between the cavities due to the circula¬ 
tor, which amounts to an effective az measurement on 
the first qubit by the environment [4]. 

To apply the average state feedback protocols, we as¬ 
sume that the effects of delay and finite bandwidth, the 
non-Markovian effects, are small, so that Popt is un¬ 
changed after neglecting them. In this limit, we can 
then first run a simulation to calculate the average evo¬ 
lution p{t) using the continuous time quantum proto¬ 
col, Eqn. (12). This simulation incorporates decoherence 
but not the feedback delay or finite bandwidth effects. 
From this first simulation, we extract Popt(t) and we then 
apply this feedback coefficient to the stochastic simula¬ 
tions using Eqn. (24), which are able to incorporate non- 
Markovian effects. This average performance is obtained 
here by averaging over 1500 stochastic trajectories. For 
these particular experimental parameters, dephasing pre¬ 
vents the fidelity from reaching the switching point ob¬ 
served above, which was confirmed by applying the same 
optimization as that of section 6 but including the above 
dephasing and decay rates. Therefore in this case, our 
hybrid strategy dictates that we apply only continuous 
feedback. 

Fig. 6 shows the time dependence of the fidelity and 
concurrence resulting from this strategy, i.e., applying 
the continuous time ASLO “quantum” feedback proto¬ 
col in the presence of the seadditional experimental im¬ 
perfections (solid blue lines). Figure 6 shows that ex¬ 
isting technology is adequate to implement our proposal 
and to deterministically entangle two transmon qubits, 
since both the fidelity and concurrence achieved substan¬ 
tially surpass the corresponding entanglement thresholds 
to a significant degree. Although the average state feed¬ 
back protocols derived here are motivated by the goal 
of achieving deterministic entanglement generation, one 
can also add post-selection to this protocol even though, 
unlike the situation in measurement induced entangle¬ 
ment without feedback [4], it is not essential here. The 
red dotted lines in Figure 6 show the effect on the fidelity 
and concurrence of using a 50% post-selection, in which 
only the trajectories for which the absolute value of the 
integrated signal from 0 to 2.7ps is less than the median 
are retained. The value of 2.7ps was chose to optimize 
the peak fidelity. This post-selection method does not 



time [ps) 



time [ps) 


FIG. 6: Continuous feedback simulations, showing fidelity 
with the I to) state as a function of time under the continuous¬ 
time ASLO protocol prescribed by Eqn. (17), and incorporat¬ 
ing experimentally realistic parameters and imperfections as 
described in the text. The hdelity reaches a much lower peak 
value than it would in the ideal case before decaying due to 
finite dephasing. Despite this, the fidelity substantially sur¬ 
passes 50%, which is the threshold for entanglement. To di¬ 
rectly quantify entanglement, we also plot the concurrence 
[43] as a function of time. Dashed and dotted curves show 
fidelity using a 50 % post-selection criteria with and without 
feedback, respectively. 


require any advanced processing of the signal and is seen 
to further enhance the fidelity and concurrence. We also 
plot the fidelity and concurrence using the same post¬ 
selection criteria but without feedback, which confirms 
that feedback results in an improvement. 

In the experiment of Ref. [ 8 ] , the propagation through 
cables introduced the largest contribution to delay. Our 
results show that this delay has a measurable effect on 
the results of the simulation. It is conceivable that the 
delay could increase as the physical separation between 
the qubits is increased. It is useful to enquire whether our 
feedback protocol remains viable in this context. The an¬ 
swer to this question lies in the fact that the measurement 
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strength k effectively sets the timescale for all feedback 
dynamics, as is evident in Eqn. (18). Making the substi¬ 
tutions k — 7 > k/s, dt dts and dW —>■ dWjy/s leads to 
recovery of the same equation. Thus the effect of feed¬ 
back delay and finite bandwidth can be removed simply 
by reducing the measurement strength. This has the ef¬ 
fect of slowing down all the dynamics. Of course, one still 
encounters a limit set by the other absolute time scales 
inherent to the problem, in particular, the time scales for 
Ti and T 2 processes. Nevertheless, this heuristic scaling 
argument shows that one can always choose k to strike 
a balance between delay and dephasing in order to opti¬ 
mize fidelity, and that it is only the effective coherence 
times that will fundamentally limit the achievable entan¬ 
glement. 

In practice, increasing the distance between the qubits 
will also increase photon loss between cavities. Since the 
measurement probe has not yet interacted with the sec¬ 
ond qubit at this point, loss between cavities introduces 
additional dephasing on the first qubit. Such dephasing 
was seen to be significant in the experiment of Ref. [4] , in 
which photon loss was dominated by the microwave cir¬ 
culator. Until better circulators are made, this source of 
loss is also likely to present a significant limitation to the 
maximum achievable entanglement using this scheme. 


8. CONCLUSION 

In this work we have developed several new protocols 
for remote entanglement generation using measurement- 
based feedback that do not require real time quantum 
state estimation. We introduced the notion of average 
sense locally optimal (ASLO) protocols, in which the 
feedback operations are determined at a specific (local) 
time by maximizing the fidelity of the average state after 
a discrete or infinitesimal measurement with respect to 
the target state. Using this approach, we derived a quan¬ 
tum feedback scheme with local unitary operations that 
allows deterministic generation of entanglement when 
the measurements can be performed with unit efficiency 
(ry = 1). The time local measurement averaging results in 
a Markovian feedback that is applicable to both discrete 
and continuous time implementation. In the continuous 
time limit, the optimal ASLO quantum feedback becomes 
equivalent to simple proportional feedback, which is eas¬ 
ily modelled using a Wiseman-Milburn equation [-5, 41] 
and which may be realized in an autonomous fashion in 
experiments using a mixer (multiplier) [ 8 ]. The ASLO 
strategy was then used to develop a discrete time step, 
semiclassical protocol, suitable for low measurement effi¬ 
ciencies and large time steps, in which only the classical 
probabilities for being in the entangled or unentangled 
states are taken into account. Analysis of the asymptotic 
behavior of both quantum and semiclassical protocols led 
to the development of a hybrid strategy which transitions 
from the quantum protocol at early times to the semiclas¬ 
sical strategy as the target state is approached at longer 


times, with an accompanying change in time step at the 
switching point. We demonstrated that such a hybrid 
strategy can be beneficial for the general case of inter¬ 
mediate measurement efficiency, 77 < 1 , with numerical 
optimizations and found strong evidence that the hybrid 
strategy is significantly closer to global optimality than 
any fixed time increment ASLO protocol. 

The ASLO feedback strategies developed here possess 
interesting relationships to the locally optimal strate¬ 
gies for single qubit purification by measurement and 
feedback [14]. In particular, the semiclassical protocol 
bears some resemblance to the locally optimal strategy 
for qubit purification in the small rj limit, while the quan¬ 
tum protocol at unit measurement efficiency also bears a 
striking resemblance to the corresponding optimal feed¬ 
back for qubit purification in Ref. 14. The advantages 
of combining two different strategies tailored to differ¬ 
ent measurement efficiencies in a single hybrid strategy 
were also observed in single qubit purification, although 
here we have taken the additional step of optimizing the 
transition time. 

We investigated the performance of these ASLO proto¬ 
cols for generation of remote entanglement between su¬ 
perconducting qubits with calculations based on exist¬ 
ing superconducting 3D transmon technology. We found 
that even under the current conditions of relatively low 
measurement efficiency, the ASLO protocols can deter¬ 
ministically generate amounts of entanglement substan¬ 
tially surpassing the entanglement threshold [44]. In 
contrast, the known methods of heralded entanglement 
based only on measurements are always probabilistic. 
We further showed that in the presence of low measure¬ 
ment efficiency, one can also use post-selection to fur¬ 
ther enhance the fidelity and we therefore developed a 
simple scheme based on the integrated signal to further 
enhance the fidelity to the target entangled state. This 
analysis highlights an interesting advantage of this re¬ 
mote entanglement generation scheme over others, such 
as those based on single photon counting and the Hong- 
Ou-Mandel effect [45, 46]. The ability to use feedback in 
our setup allows one to enhances the probability of suc¬ 
cess in a way that is not possible in these other heralded 
entanglement schemes. While the analysis presented in 
this work most immediately applies to superconduct¬ 
ing qubits in microwave cavities, it could of course be 
adapted to any system in which one can implement a 
weak measurement of the half-parity observable defined 
in section 2 . 

This work suggests several future avenues for research 
that would be interesting from both a theoretical and an 
applied point of view. First, one could apply the veri¬ 
fication theorems to address the question of whether or 
not our protocols are globally optimal [47]. In the case 
of unit efficiency and no decoherence, we speculate that 
our solution is globally optimal, although we have not 
attempted to prove this statement. The deterministic 
evolution of the state under the quantum ASLO proto¬ 
col indicates promise for this, in part because this fea- 
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ture, i.e., the deterministic evolution, is also observed 
in the globally optimal protocol for single-qubit purifi¬ 
cation. Second, while fidelity is a useful figure of merit 
for generating known states, concurrence may be more 
suitable for a detailed theoretical study and could yield 
additional insights. For instance, unlike fidelity, concur¬ 
rence is invariant under the allowed feedback Hamiltoni¬ 
ans, mirroring the behavior of entanglement. It would be 
useful to see whether this shared symmetry could be ex¬ 
ploited. Finally, while we have ignored dephasing in the 
development of the ASLO feedback protocols, in apply¬ 
ing the protocols to the realistic experimental setting for 
superconducting transom qubits, we found that dephas¬ 
ing presented the main limitation to performance. This 
suggests that it would be useful to investigate related pro¬ 
tocols that would focus on correcting dephasing errors, 
and hence assist with remote entanglement stabilization. 
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Appendix A: Derivation of finite time POVM 

As discussed in the main text, the stochastic mas¬ 
ter equation (SME) associated with a continuous-time 
QND measurement of the Hermitian observable X with 
strength /c, on a system with free Hamiltonian H is: 

dp = — j[iF, p\dt + 2k'D[X]pdt + V^'H[X]pdW(t). (Al) 


unnormalized density matrix. Further, since this equa¬ 
tion is linear we can restrict our focus to its action on 
pure states (since any density matrix is a convex sum of 
pure state density matrices). This defines the associated 
linear stochastic Schrodinger equation (SSE) [48]: 


d\'ip') = {—iH — fcA^) dt + V^dW{t)X 


(A4) 


This is again a linear equation in the state, 

Both the linear SME and linear SSE sacrifice the nor¬ 
malization of the resulting state for linearity. To see what 
this means physically, note that both equations are con¬ 
sistent with {X) = 0, and hence dV{t) = , which 

means that we are generating conditional dynamics ac¬ 
cording to statistics associated with some fictitious (time- 
independent) state with property {X) = 0. The real state 
\il)) may not have this property and this is why the nor¬ 
malization is incorrectly predicted by Eqn. (A4). 

Despite this issue with unnormalized states, the utility 
of Eqn. (A4) is that it is sometimes possible to analyt¬ 
ically solve for |'0), see for example Ref. [50]. In fact, 
the linear SSE in Eqn. (A4) is in the easiest class of such 
equations to solve since all the operators in it commute. 
We are interested in the solution to Eqn. (A4) over a 
small finite measurement time At. This can be solved 
explicitly and is given by [50]) 

|W(< + At)) = |^(t)) 

^ ^-iHAt^-2kX^At+4kXAV 

= 0,Av\'ip{t)), (A5) 

for any initial state |V'(t)) (including normalized states), 

and where AIU = dW{t'), is a Gaussian random 

variable. In the second line we have used the relation 
between the measured voltage and AW to write the so¬ 
lution in terms of the measured quantity, AU, which has 
probability distribution 


Pav 



4kAv‘^ 

At 


(A6) 


The superoperators V and P are defined in the main 
text. We assume that the measurement is unit efficiency, 
and since it is QND, [H, X] = 0. This equation describes 
the evolution of the system conditioned on the measured 
voltage 

dV{t) = {X)it)dt+^^. (A2) 

The linear stochastic master equation [48-50] associ¬ 
ated with this equation is: 

dp = -i[H, p]dt + 2kV[X]pdt + V^'H[X]pdW{t), (A3) 


Eqn. (A5) generates states that do not quite yield correct 
predictions since the normalization of the state is incor¬ 
rect. Consider a normalized initial state {iplt)). Then 
the probability of evolving during the measurement time 
according to the stochastic variation AU to time t + At 
is not actually ('0(t -I- At)\'ijj{t + At)), but rather [48-50] 

PAv{'ip{t + At)\'tlj{t + At)) = Pav |'0(i)) ■ 

This expression immediately tells us that the measure¬ 
ment operator associated to the true finite time evolution 
is 

Uav = V Pav^av, (A7) 


with P[A]p = Ap -I- pAb While Eqn. (Al) is non¬ 
linear in p and produces a normalized density matrix 
(i.e., tr(dp) = 0), Eqn. (A3) is linear in p and produces an 


since the true probability of this stochastic evolution is 
('0(i)| Eav l'!/’(^)) for the positive-valued effect Eav = 
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Now, solving for this effect for the QND measurement, 
we get 


^AV 



^ -2kX^At+4kXAV 



^ ^-iHAt^-2kAt{^-X)\ 


(AS) 


Here the integral on the third line is a Gaussian integral 
that can be evaluated by completing the square in the 
exponent. 


Hav is the finite-time measurement operator associated 
with the rj = 1 weak QND measurement used in the 
main text (with H = 0), and E/^y = is its 

corresponding effect. 

For completeness we can also show that the effects de¬ 
fined above constitute a POVM since. 


d{AV) dEAv 



d{AV) H^yHAv 

diAV) p^ye-4fc^"At+8fcXAy 


J°°d{AV) 




^AkX^At 


(A9) 
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